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(^ introduced. Some of these new quarks can be relatively light, with a mass of a few hundred 

^— ^ GeV, and could be observed with the early LHC collision data expected to be collected 

;• during 2010. We analyse in detail the collider signatures that these new quarks can produce. 

We show that final states with two (same-sign) or three leptons are the most promising 

discovery channels. They can yield a ha excess over the Standard Model expectation 

Cd already with the 2010 LHC collision data. Exotic quarks of charge 5/3 are a distinctive 

feature of this model. We present a new method to reconstruct their masses from their 

leptonic decay without relying on jets in the final state. 
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1. Introduction 

The Standard Model (SM) of particle physics has so far been experimentally confirmed in 
many of its aspects. Yet, a fundamental piece is still missing; namely, the understanding of 
the mechanism responsible for the breaking of the SU{2)l x U{1)y electroweak symmetry. 
The 'minimal' description provided in the SM consists in the introduction of a complex 
scalar doublet if. Electroweak symmetry breaking (EWSB) is achieved assuming that this 
field acquires a non-zero vacuum expectation value. After EWSB, only one physical degree 



of freedom survives: the Higgs boson. Experimental results point towards a relatively light 
particle. If the Higgs boson exists, it should be within reach of the LHC. 

A light fundamental scalar is not natural, though. Radiative corrections are expected 
to drive its mass close to the Planck scale (or to the scale of onset of some new physics). 
An elegant way to prevent this is through symmetries. The most famous example is su- 
persymmetry, that exploits the cancellation between the contributions given by fermions 
and by bosons to the Higgs self-energy. This is not the only solution. Composite Higgs 
models [1, 2] provide an alternative mechanism to explain the lightness of the Higgs boson. 
In these models the Higgs boson arises as a composite state of some new, strongly inter- 
acting sector. The new sector possesses a global symmetry that is spontaneously broken 
at some scale /. This symmetry breaking gives (at least) four Goldstone bosons that can 
be arranged into a complex SU{2)l doublet, which we identify with the Higgs doublet. 
Upon gauging the electroweak symmetry group, the Higgs boson acquires a potential, and 
hence a mass. Since we are interested in the low-energy regime of this strongly coupled 
theory, we can adopt an effective Lagrangian approach [3]. We will consider the minimal 
symmetry breaking pattern 50(5)/«S'0(4) [4], that also preserves custodial symmetry. 

The Higgs boson is not necessarily the only composite state of the new sector to be 
relatively light. In particular, the mixing of the top with composite quarks can explain 
the large top mass. These composite quarks can give significant contributions to the elec- 
troweak precision observables, thus modifying the region of parameter space that is allowed 
for these models [5, 6, 7, 8, 9, 10]. For this study, we focus on a non-minimal realization 
of this model, where two multiplets of top-partners in the fundamental representation of 
50(5) are introduced [10]. 

The LHC is expected to run throughout this year at a center of mass energy ^/s = 7 TeV, 
opening an unprecedented window for searches of new phenomena in particle physics. The 
first glimpse of new physics could well be due to new heavy quarks, which are a rather 
common feature of Beyond the Standard Model (BSM) scenarios. The discovery potential 
of such heavy quarks has been studied in the context of little and littlest Higgs mod- 
els [11, 12, 13], warped extra dimensions [14, 15], fourth generation quarks [16, 17, 18, 19] 
and generic vector-like quarks in isospin singlets or doublets and with different hyper- 
charge [20]. If new quarks are observed, we will need a way to understand which model 
they point at. For this reason we focus on collider signatures that can be considered dis- 
tinctive of the composite Higgs model under study. In particular, we look for configurations 
in which either two charge 5/3 quarks or a full, almost degenerate 4 of 50(4) lie within 
the reach of the 2010 runs at the LHC. For these distinctive signatures, we discuss the 
phenomenology and study the discovery potential on the basis of 200 pb~ of collision data 
at \/s = 7 TeV. We study the event yield with respect to the SM expectation in various 
multi-lepton channels for different points in the parameter space. We outline a new method 
to reconstruct the mass of a charge 5/3 top-partner exploiting its leptonic decay channel. 
We show that with only about 50 signal events in the same-sign di-lepton final state, this 
method can be used to judge if the signal is mainly due to one charge 5/3 quark or rather 
produced by the contributions from multiple top-partners. 

The paper is organized as follows. In section 2 we review the composite Higgs model 



of ref. [10]. In section 3 we discuss the general features of the phenomenology of the 
two distinctive signatures of the model. In section 4 we describe how the model was 
implemented in an event-generator to allow for consistent event generation within a specific 
point in parameter space. The generation of signal and background samples and the fast 
detector simulation are discussed in section 5. Section 6 is dedicated to the description of 
the discovery potential in multi-lepton final states. We focus on two particularly interesting 
points and discuss their phenomenology and the discovery potential by means of a robust 
cut-based analysis. In section 7, we present a new method to reconstruct the mass of a 
charge 5/3 top-partner via its leptonic decay. 

2. The model 

2.1 The Higgs sector 

We consider a strongly interacting sector that can be described at low energy by a non- 
linear sigma model. The cutoff of this model is A{/v = Airf/^/Nc where Nq is the number 
of Goldstone bosons and / is the scale at which the SO (5) — t- 50(4) breaking occurs. This 
scale is assumed to be larger than the EWSB scale v = 174 GeV. Too large values of / 
would introduce a substantial fine-tuning of the model [6]; on the other hand, if the scale of 
new physics is too low, large contributions to electroweak parameters and flavour physics 
are introduced. For these reasons we set / = 500 GeV, which corresponds to a ~ 10% 
fine-tuning [6]. 

The 50(5) — >■ 50(4) breaking is realized through a scalar subject to the constraint 



In the non-linear representation 



^^^-^T^h,V2/f ^ (2.1) 



where (po = (0, 0, 0, 0, /) is the vacuum state that preserves 50(4), T"- are the four broken 
generators and ha the corresponding Goldstone bosons. Expanding the exponential, we 
get 

</> = /( y sin -, cos - j = (^(f, (j)5J , (2.2) 

where (p = (<^) <^) transforms under the fundamental representation of 50(4) = SU{2)l x 
SU{2)ji and h = yh"^- We denote by ip and (p the SM Higgs doublets with hypercharge 
-1-1/2 and -1/2. Finally, we gauge SU{2)l and the T^ generator of SU{2)r. This explicitly 
breaks the 50(5) symmetry and induces a potential for the Higgs boson, that becomes a 
pseudo-Goldstone boson. Since the potential is generated at loop level, the mass of the 
Higgs boson is expected to be light. Throughout this study we set mh = 120 GeV. 
The usual relation for the mass of the W boson, 

-^ = ^ , v' = l{$') (2.3) 



holds provided that we set 

f^/2(ip)\ vV2 , ^, 

™ [-it) = ^^"- <'■'" 

For Sa = 0, (f> = (j)(), electroweak symmetry remains unbroken and the gauge bosons are 
massless, while Sq, = 1 corresponds to maximal EWSB. 

Higgs compositeness, together with the requirement for canonical normalization of the 
kinetic term, leads to a rescaling of the physical Higgs field by a factor Ca = \J\ — 2v'^ / p. 
This implies an analogous reduction of the couplings between the Higgs and the gauge 
bosons and gives in turn some dependence of the electroweak precision test (EWPT) ob- 
servables on the UV cutoff of the model. In fact, in the SM the Higgs boson regulates the 
logarithmic divergencies of the gauge bosons self-energies. In the heavy Higgs approxima- 
tion, the Peskin-Takeuchi parameters S and T [21] read 

S,T = as,T log TUh + bs,T , (2.5) 

where ttt.^ is the mass of the Higgs boson and as^x, bs,T are constants. The reduction of 
the Higgs boson couplings to the gauge boson spoils the cancellation of the logarithmic 
dependence on the UV cutoff, so that now 

S,T = as,T{cllogmh + sllogAuv) + &s,T ■ (2.6) 

This can be taken into account when one computes EWPT observables by replacing the 
Higgs mass with an effective mass [6] 

2 

w.EWPT,eff = ruh {Kuv/rrihY" ■ (2.7) 

As a consequence, we obtain an extra positive contribution to 5 and a negative contribution 

to T, 

A5 = -^logf%I-^') , AT = --Vlogf^%^') , (2.8) 

where cw is the cosine of the Weinberg angle and rrih^rci is the Higgs mass used in the 
electroweak fit. 

On top of this, one can expect the strongly coupled dynamics itself to affect EWPT 
observables through some higher-dimensional operator. This model includes custodial sym- 
metry to protect the T parameter. A reasonable estimate of the contribution to 5 is [6] 

-^^-S^$^--(^)^- 

Combining the effect from Higgs compositeness and higher-order operators, one typ- 
ically obtains too large contributions to the S and T parameters and the model is not 
compatible with current EWPT constraints [6, 9, 10]. Yet, one can expect other composite 
states to be as well below the cutoff of the effective theory. Here we will consider the case 
of fermionic resonances and analyze how they can improve the agreement of the model 
with observations. 



2.2 The fermionic sector 

We consider vector-like resonances of composite fermions transforming in the fundamen- 
tal representation of SO{5). We denote them by ^*, with the index i running over the 
multiplets included below the cutoff. The corresponding mass Lagrangian is [9, 10] 



-C 



SO(5) = M,^*^* + ^(*V)('A^^'), 



(2.10) 



where yij is a Hermitian matrix. Under the electroweak gauge group, ^ decomposes as 
^ = {Q,X,T), where Q and X are SU{2)l doublets with hypercharge +1/6 and +7/6 
respectively, and T is a SU{2)l singlet with hypercharge 2/3. The X doublet introduces 
another quark of electromagnetic charge 2/3, which can mix with the top, and a quark 
with charge 5/3. Such quarks are one of the distinguishing features of the model. The SM 
quarks ql and tj^ have the same quantum numbers as Q and T, respectively. The most 
generic interaction between the top sector and the new quarks is therefore of the form 



-Ant = ^WQr + ^RTltR + h.C. . 

Combining eqs. (2.10) and (2.11) we obtain the mass matrices 
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for the quarks of charge 2/3 and 
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(2.13) 



for the quarks of charge -1/3. The indices u and d denote respectively the charge 2/3 and 
-1/3 components of the doublet indicated. In the case of more fermionic resonances, the 
mass matrices are to be understood as in block form. Note that in eq. (2.13) we introduced 
an explicit 50(5) breaking term 



C = XbqL^hR 



(2.14) 



to give a mass to the bottom quark. We could also generate a mass for the bottom quark 
in an 5*0(5) preserving fashion. For example, we could couple the bottom quark to some 
new multiplets of 50(5), as we did for the top quark. This would come at the expense of 
introducing extra particles. Since the mass of the bottom quark is small, we do not expect 
large effects from bottom compositeness. We opt therefore for a minimal description, in 
which the bottom mass is generated with the current particle content of the model. 

The couplings of the fermions to the Higgs boson are obtained expanding the second 
term in eq. (2.10) around the vev of (j). For example, the couplings of the charge 2/3 quarks 



to the Higgs boson are given by 
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V2 v^ 

Here we already included the suppression factor Cq in the Higgs couplings. 

As it was emphasized in refs. [6, 8, 9, 10, 22], composite Higgs models with only one 
set of fermionic resonances below the cutoff of the effective theory are very constrained 
from EWPT. The charge 5/3 quark is the lightest new particle predicted, with a mass 
771,5/3 ^ 500 GeV. Above it and rather close in mass (Am < 100 GeV) is a charge 2/3 
quark. The other quarks are typically much heavier. The most relevant collider signatures 
therefore come from the production and decay of the charge 5/3 quark. These signatures 
have been studied in detail in [23, 24]. 

The scenario dramatically changes if we include a second set of composite fermions 
below the cutoff [10, 25] . Constraints from EWPT become less stringent, and many different 
mass patterns are allowed in the region accessible with first LHC data. In the next sections 
we discuss the collider signatures and discovery potential of this model. 

2.3 Parameter-space scan 

We scan over the parameter space of the model in order to find regions compatible with 
EWPT. From eqs. (2.10)-(2.14) we see that there are 6 variables parametrizing the fermionic 
sector with one multiplet below the cutoff, and 11 for the case of two multiplets. In both 
cases, Sa is fixed through eq. (2.4), as we have v = 174 GeV and / = 500 GeV. We fix 
other two parameters in such a way to obtain the measured top and bottom masses [26, 27] 



mt = 172.4 GeV 



and 



mb = 4.2 GeV . 



This is more easily done if we factor out of the mass Lagrangian (2.12) (or eq. (2.13) 
for the bottom quark) one of the parameters, say M\ (A;,). Then we diagonalize the 
remaining part of the mass matrix and fix Mi (At) so that the mass of the lightest quark 
is mt (mi,). We are left with eight free parameters in the case of the two- multiplet model. 
We require that the resulting quarks contribute to EWPT observables in such a way to 
make the model compatible with observations. We use the same fit as in [10] to assess the 
agreement between a point in parameter space and experimental constraints. Furthermore, 
we exploit the value of x^ that parametrizes this comparison in order to drive our Vegas- 
based analysis [28]. The procedure is the following. We use Vegas to randomly sample 
on the eight-dimensional parameter space. For each point sampled, the value returned to 
Vegas as an 'integrand' is 1/x^- By construction, Vegas will focus its sampling on the 
points that lead to a higher value of the integrand l/x^? i-e., to a better agreement with 
EWPT. We retain points that are compatible with EWPT at 99% C.L. . 

We further refine our search asking for signals which are characteristic of the two- 
multiplet model. As we said, in the case of only one multiplet below the cutoff, the mass 



spectrum of the new resonances is typically rather spread out. The charge 5/3 quark 
has a mass of some few hundred GeV, while the charge -1/3 quark is very close to the 
cutoff. A signature of a two-multiplet model would then be a charge 5/3 quark in a 4 of 
5*0(4), i.e. very close in mass to two charge 2/3 and one charge -1/3 quarks. We require 
the mass difference among these particles to be < 60 GeV, so that decays through off- 
shell gauge bosons are strongly suppressed. Another typical signature of the model is the 
presence of both the charge 5/3 quarks. We take these two signatures as neat indications 
of this particular model and focus on their discovery potential with early LHC data. With 
200 pb~^ of collision data at 7 TeV, a significant number of quarks with masses below 
~ 500 GeV should be produced^. We will set this value as an upper bound in our search 
for the two distinctive patterns that we just discussed. Direct searches have set lower 
bounds for the mass of new quarks. We use the most recent results from Tevatron on the 
exclusion of a charge 5/3 top-partner [29]. For this quark, the only decay channel is tW^, 
as in the reference. We do not use instead the most stringent exclusion bounds on the 
charge -1/3 and 2/3 quarks, [29, 30] and [31], as they assume the new quarks to decay 
entirely through either W or Z. This is not the case in our model. Therefore, as lower 
mass bounds we set 

7715/3 > 365 GeV , 'm2/s, w,-i/3 > 260 GeV . 

3. Phenomenology of the two-multiplet composite Higgs model 

In this section, we outline some of the basic features of the phenomenology that we expect 
from the two-multiplet model. This phenomenology is largely determined by the mass 
hierarchy of the 10 new quarks. The mass eigenstates (ordered according to increasing 
mass) of the new top-like quarks will be named ti, t2, ts, ^4, ts and t^, whereas the charge 
5/3 quarks and the bottom- like quarks will be denoted as xi, X2 and 6i, b2, respectively. 

Dominant decay modes. The process gg — t- qq plays the dominant role in the produc- 
tion of heavy top partners at the LHC. Therefore, we analyse the decay chains that start 
from pair produced quarks. Figures 1 and 2 show two Feynman diagrams for a possible 
decay chain of a tiii and a xixi pair. 

In all points of the parameter space that satisfy the selection criteria of section 2.3, 
the two lightest new quarks are xi and ti. The signatures from this model that could be 
observed early at the LHC will be therefore dominated by the decay modes of these two 
quarks. We also find that their mass difference is always too small for the heavier of the 
two to decay into the lighter. Consequently, only the following channels are accessible for 
the decay of the two new quarks 

h -^tZ ti-^ th ti -^ bW^ xi -^ tW^ . (3.1) 



^For reasons that we will explain later, we focus on decay channels which produce at least two charged 
leptons in the final state. We estimate a leading order cross section of 207.8 ± 0.5 fb for pair production of 
a quark with a mass of 500 GeV at ^/s — 7 TeV (the stated uncertainty is due to statistics only). Taking 
into account the branching ratio for the W and Z bosons to decay leptonically, we cannot expect to observe 
more than a handful of events in the considered channels for an integrated luminosity of 200 pb~^. 
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Figure 1: Example Feynman diagram for ti 
pair production with a possible decay chain. 




Figure 2: Similar diagram for xi pair pro- 
duction. 



The lightest bottom-like quark bi, which we always find to be heavier than ti and xi, 
decays predominantly via 

bi -^ tW~ . (3.2) 

The other possible decays, 6i — )• bZ and 6i — )• bh, are strongly suppressed because of 
the small off-diagonal couplings. Such small mixing is a consequence of the fact that 
the bottom quark is mainly fundamental. We find that the decay 6i — t- tiW~ is not 
kinematically accessible. 

Phenomenology of the 4 of SO(4). We consider xi, ti, t2 and bi to form a 4 of 5'0(4) 
when the maximal mass difference among the quarks is < 60 GeV. In this way none of the 
new quarks can decay into another one, since decays through the W, Z and h bosons are 
not kinematically allowed. Consequently, all these four new quarks can only decay to the 
SM top and bottom quarks. 

Phenomenology of the XX. The phenomenology can be much richer if both charge 5/3 
quarks are below 500 GeV and no restriction on the maximal mass difference among the new 
quarks is imposed. However, the exclusion limits from the CDF experiment in combination 
with the upper bound of 500 GeV for early detection imposes strong restrictions on the 
cascade decays that are kinematically allowed. Often, the mass differences of these quarks 
are such that they only decay via the channels given in (3.1) and (3.2). The two lightest 
quarks are xi and ti, where either of the two can be the lighter one. Going up in mass, 
we find 6i and t2, or t2 and bi. The next heavier quark is either X2 followed by t^, or vice 
versa. The most common hierarchy is 



ruxi < "iti < "ifoi < '>^t2 < rn. 



X2 



A rarer mass pattern is 



mil < "^xi < "2*2 < m-bi < mtg < mx2 < "^t4 



(3.3) 



(3.4) 



The quarks that do not appear in these relations have masses above 500 GeV. An example 
for a cascade decay accessible for various points is 

t2 -^ xiW- -^ tW+W- -^ hW+W+W- . (3.5) 

Both this cascade decay and the dominant decay modes from eqs. (3.1) and (3.2) suggest 
that the model can easily yield multi-lepton final states plus many jets. The SM is expected 
to produce only few events with such a signature. For this reason, we focus our study on 
final states with at least two leptons and multiple jets. 

4. Model implementation in MadGraph and categorization of points 

4.1 Implementation 

We implement the model in MadGraph/MadEvent 4 (MG/ME) [32]. MG/ME is a matrix- 
element based tree- level event generator that is capable of generating amplitudes and events 
for any given model describing high energy physics interactions. For such an event genera- 
tor to be able to cope with a new physics model, the couplings and interactions of the new 
particles as defined in section 2.2, in addition to the (modified) Standard Model interac- 
tions, have to be translated into a specific form. In MG/ME these couplings are defined 
according to the convention from HELAS [33] . The implementation of these couplings is 
done by means of the usermod vl framework. The decay widths and branching ratios of 
all unstable particles are calculated with BRIDGE [34]. 

We implement the model taking into account not only the couplings of the newly 
introduced particles, but also the changes in the Standard Model couplings arising from 
Higgs compositeness and from the mixing of the SM quarks with the new states. 

4.2 Benchmark points in the composite Higgs model parameter space 

Since the scan over the parameter space was optimized to search for points that satisfy the 
selection criteria of section 2.3, the points returned are not necessarily very different from 
each other. For this reason, we arrange the points in groups that are expected to exhibit 
a similar phenomenology and focus on the representatives of these groups for a detailed 
study. We assign two points to the same group if all branching ratios of the new quarks 
with a mass below 500 GeV are of similar magnitude. When a group contains more than 
one point, we use the mass of the lightest new quark m^jow to select two representatives: 
the point with the lowest value of rrigiow and the one with largest value of mgiow In the 
following, these two representatives will be referred to as low benchmark point (IBP) and 
high benchmark point (hBP) of a group. For the discussion of the discovery potential, we 
will restrict ourselves to the 30 benchmark points obtained in this way. 

5. Event generation and detector simulation 

5.1 Event generation 

For each benchmark point, we produce 10^ signal events with MG/ME. In particular, we 
generate events for pair production of all new quarks that have a mass below 500 GeV. 
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IBP 1 


XX 


xi: 


365.6 


4.01 


IBP 2 


XX 


xi: 


366.0 


3.85 


hBP 1 


XX, 4 


xi: 


429.8 


2.12 


hBP 2 


XX, 4 


xi: 


408.9 


2.50 


IBP 3 


XX, 4 


xi: 


366.1 


7.90 


IBP 4 


XX, 4 


xi: 


400.9 


4.45 


hBP 3 


XX, 4 


xi: 


404.0 


3.94 


hBP 4 


XX, 4 


xi: 


462.9 


1.61 


BP5 


XX 


xi: 


366.7 


3.45 


BP 6 


XX, 4 


xi: 


463.9 


1.57 


BP 7 


XX, 4 


xi: 


461.5 


1.70 


BP 8 


XX, 4 


xi: 


456.9 


1.58 


BP9 


XX 


xi: 


365.7 


3.88 


BP 10 


XX, 4 


ti: 


316.6 


10.78 


IBP 11 


XX, 4 


xi: 


377.5 


6.20 


IBP 12 


XX 


xi: 


367.7 


4.29 


hBP 11 


XX, 4 


h: 


393.3 


3.86 


hBP 12 


XX, 4 


xi: 


391.5 


5.52 


BP 13 


XX, 4 


xi: 


373.5 


7.80 


BP 14 


XX, 4 


ti: 


343.4 


7.91 


IBP 15 


4 


xi: 


365.8 


5.58 


IBP 16 


4 


xi: 


365.4 


5.18 


hBP 15 


4 


xi: 


376.7 


4.40 


hBP 16 


4 


xi: 


450.2 


1.43 


IBP 17 


4 


xi: 


365.9 


5.23 


IBP 18 


4 


xi: 


365.2 


5.52 


hBP 17 


4 


xi: 


438.3 


1.66 


hBP 18 


4 


xi: 


414.2 


2.53 


BP 19 


4 


h: 


375.3 


4.92 


BP20 


4 


ii: 


379.8 


4.35 



Table 1: The 30 benchmark points of the two multiplet model with the mass of the lightest quark 
iTiq,\ow and the total cross section for pair production of all quarks below 500 GeV. The cross sections 
are calculated at leading order for y/s = 7 TcV. 



The outcome of the MG/ME event generation is a Les Houches event file [35], which we 
process with Pythia 6 [36] for the showering and hadronization of the partonic events and 
for the simulation of the underlying event. Table 1 lists the mass of the lightest particle 
rrig low and the total leading order cross section for pair production of all considered quarks 
for each benchmark point. 

As already mentioned in section 3, we focus on final state signatures with at least 
two charged leptons and multiple jets. Consequently, every SM process that can lead to 
such final states represents a possible background. Table 2 lists the leading order cross 
section and the number of generated events for all relevant background processes. Note 
that single top production was neglected for this study. Its contribution is expected to be 
within the uncertainty of the pair production cross section. In order to estimate correctly 
the momentum spectrum of the jets in the transverse plane of the detector, we generate 
all partonic multiplicities needed for the SM backgrounds in MG/ME and use Pythia 
for the parton shower. The overlap between the phase-space description of the matrix- 
element calculator and the parton shower is removed using the MLM parton-jet matching 
prescription [37]. For the signal samples, the jets produced by the parton shower in the 
decay of very heavy particles are known to be satisfactory [38]. The underlying event 
is simulated with Pythia. For all samples, we use the parton distribution function set 
CTEQ6L1. 

We would like to point out that the samples for the background processes were gen- 
erated within the SM. We did not take into account the changes of the SM couplings 
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process cross section (pb) # of generated events 
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+ <3j 


2400 


1302405* 


w 


+ <3j 


24170 


12270142* 


vv 


+ <lj 


4.8 


113764* 


w^w^ 


+2j 


0.2119 


47070 


W+W'W^ 




0.04105 


49999 


ti 


+ <3j 


95 


1395630* 


tiW^ 


+ <lj 


0.1687 


66266 


ttz 




0.1038 


49999 



Table 2: Background processes with the corresponding cross section and the number of generated 
events. The di-vector boson sample VV + jets includes all processes with two W or Z bosons, 
except for the case of two W bosons with the same charge. In the first three samples, the vector 
bosons are forced to decay leptonically. The (*) indicates that we used the MG/ME Les Houches 
events made available from the LCG Monte-Carlo Data Base [39] instead of generating the events 
ourselves. 



introduced in the composite Higgs model. These modifications differ for each point in the 
parameter space of the model, but we expect the resulting effects on the SM backgrounds to 
be small. Also note that we only consider pair production of the new quarks for the signal 
samples. We neglect the contributions of other processes (such as single quark production) 
to the signal yield in multi-lepton final states. These additional contributions to the signal 
would enhance the excess over the SM expectation. 

5.2 Detector simulation 

We use DELPHES [40] for the simulation of the response of a typical LHC detector. 
DELPHES is a recently developed simulation framework for a generic collider experiment. 
As CMS is one of the two general purpose detectors at the LHC, we use the CMS detector 
card for the DELPHES detector simulation. We reconstruct the jets with the anti-/cf jet- 
clustering algorithm [41] and use a cone radius Ai? = y A^^ + Ar/^ = 0.5. (j) denotes the 
azimuthal angle and the pseudorapidity rj is defined as 77 = — In tan ^ , where 9 is the angle 
between the beam pipe and the trajectory of the particle. To adapt the performance of 
DELPHES to our needs, we make the following modifications. 

• In DELPHES, the possibility of a jet being reconstructed as an electron is not taken 
into account. This, however, is expected to be a relevant source of fake electrons. 
In ref. [42], the probability for a jet to be reconstructed as an isolated, identified 
electron is estimated to be at a level of 6 • 10~^. We use this result and add jets to 
the isolated electron collection with the stated global probability. 

• We set the global tracking efficiency to 100% for tracks with a transverse momen- 
tum of at least 0.9 GeV, but remove electrons from the electron collection with a 
probabihty of 10%. 
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5.3 Lepton and jet identification 

We outline a robust and simple event selection that is suitable for early data from the LHC. 

Charged lepton selection. For the electrons and the muons, we demand a transverse 
momentum px > 20 GeV and a pseudorapidity \r]\ < 2.4. The first cut ensures a robust 
identification of electrons and muons, both offline and on trigger level, whereas the second 
cut is made to restrict the leptons to the volume of the tracker. For this study, we are 
interested in prompt leptons coming from vector boson decays. To discriminate against 
leptons coming from semileptonic hadron decays, we apply a relative isolation. In partic- 
ular, we sum the pT of the tracks in a cone with AR < 0.3 around the electron or muon 
under analysis and require this value to be smaller than 5% of the lepton momentum. 

Jet selection. To obtain a robust jet selection, we demand the pT of a jet to be larger 
than 50 GeV and require |ry| < 3. The conservative choice of pT > 50 GeV is made to 
minimize the contribution of fake jets. The second cut marks the end of the electromagnetic 
and hadronic calorimeters. As electrons may be reconstructed as possible jet candidates, 
we reject those jets that are matched within AR < 0.2 to an isolated electron. The jet 
collection can be further cleaned from such electrons by requiring that the jets should have 
an electromagnetic fraction (electromagnetic over hadronic energy deposits) of less than 
0.98. 

Purity and efficiency of the lepton selection. Imposing a harder cut on the lepton 
isolation enhances the purity of the selection but causes the efficiency to decrease. The 
goal is to achieve a pure selection of prompt leptons without losing too much efficiency. By 
purity we define the number of isolated leptons matched to prompt MC leptons divided by 
the number of isolated leptons. The efficiency is defined as the number of isolated leptons 
divided by the number of MC prompt leptons. The number of matched isolated leptons is 
obtained by counting the ones that satisfy both criteria: 

• they have a prompt MC lepton within a cone of AR < 0.2 

• the equation '''^■"""^^■"^l < 0.2 holds. 

Pt,mc 

For the ti sample from table 2, we find an efficiency of 83% and a purity of 97% for the 
electrons. For the muons, we obtain an efficiency of 91% and a purity of 99%. 

6. Discovery potential at the LHC 

6.1 Identification of promising channels 

After applying the lepton and jet selection defined in section 5.3, we investigate the number 
of events for a given jet multiplicity and lepton configuration for each background and signal 
process. The lepton configurations go from di-lepton events - same-sign (SS) or opposite- 
sign (OS) - to events with up to five charged leptons in the final state. Each configuration, 
which is characterized by a certain lepton combination and jet multiplicity, is interpreted 
as a specific signal region with an associated cut efficiency. Since these cut efficiencies are 
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= ^o■' 3i 



2I05 5F 21 OS OF 31 SS 31 OS 




21 SS 21 OS SF 21 OS OF 31 SS 31 OS 41 51 



Figure 3: The upper limit for the total num- 
ber of background events based on a 95% 
confidence level. This plot includes all rel- 
evant backgrounds scaled for an integrated 
luminosity of 200 pb~ . The y-axis shows 
the jet multiplicity, whereas the lepton con- 
figuration is given on the x-axis. 



Figure 4: The same plot as on the left hand 
side, but showing the lower limit for the ex- 
pected number of signal events for BP 10. 



based on a finite MC statistics, we observe certain configurations with non-vanishing signal 
but zero background events. To avoid this issue, we calculate for all configurations lower 
and upper limits for the cut efficiency for the signals and backgrounds respectively, based 
on a 95% confidence level. This can be interpreted as a worst case scenario for the discovery 
of the model. The expected number of signal and background events are obtained by mul- 
tiplying these efficiencies by the integrated 
luminosity of 200 pb~ times the corre- 
sponding cross section (as listed in tables 1 
and 2). 

In figures 3 and 4 we plot the jet mul- 
tiplicity versus the lepton configuration, 
respectively for the total SM background 
and for the signal for BP 10. We denote 
by SS the configurations in which all the 
leptons have the same charge. Configura- 
tions in which at least one lepton has a 
different charge are denoted by OS. In the 
OS di-lepton case, we also distinguish be- 
tween the opposite-flavor (OF) and same- 
flavor (SF) configurations. For the bins 
in figure 3 for which zero MC background 
events were found, we calculate an upper 
limit of 2.13 background events with a confidence of 95%. This number is dominated by 




21 OS SF 21 OS OF 31 SS 



Figure 5: The expected number of signal events 
for BP 10 divided by the total number of events 
from the SM background. 



13 



the contributions from W + jets and Z + jets due to their large cross sections and the 
Hmited MC statistics. In figure 5, we plot the number of signal events for BP 10 divided 
by the total number of background events. In terms of number of expected events over 
the SM background, we can see that the final states with SS di-leptons and OS tri-leptons 
are the most promising channels for a possible discovery with 200 pb~ of collision data. 
This observation holds for all the 30 benchmark points. The decrease in the plotted S/B 
ratio for large jet multiplicities is not expected in collision data. This effect is due to the 
combination of a finite number of MC events with the calculation of an upper limit on the 
number of background events. In the light of the above discussion, we will focus on the 
four channels: SS di-lepton with 3 or 4 jets and OS tri-lepton with 2 or 3 jets. 

6.2 Inclusive discovery potential 

In order to quantify the discovery reach in the four channels above, we calculate the prob- 
ability for the expected signal + background observation to be caused by a fiuctuation in 
the background distribution. We use 21ogX as a test statistic, where X is the ratio of the 
likelihood function for the signal + background hypothesis Hi to the likelihood function 
for the background hypothesis Hq [43, 44]. 

The likelihood ratio Xi for the channel i can be defined as 

X. = ^ , (6.1) 

where 

LHt,i = ^^^j , LhoA = j;-^^ • (o.ij 

Here, Si and bi denote the number of signal and background events, respectively, and di is 
the number of observed candidates. Since the statistic 21ogX for the outcome of multiple 
channels is the sum of the test statistics of the channels separately, we use 2 X^j^^ log Xi 
for the combined four channels defined in section 6.1. We define the confidence level as 

CLb = Pb{X < Xobs) , (6.3) 

where the probability sum assumes the presence of the background only. Note that the 
background confidence 1 — CLb expresses the compatibility of the observation with the 
background hypothesis, since CLb is the probability that the background processes would 
give fewer than or equal to the number of events observed. For this reason, we use CLb to 
quantify the discovery potential. The background confidence 1 — CLb can be compared with 
the widely used notion of standard deviations (a) by using the convention from ref. [27]^. 
There, a 3cr and 5a excess beyond the background expectation corresponds to a one-sided 
background confidence level of 1 — CLb = 1.35 • 10~^ and 1 — CLb = 2.87- 10~^ respectively. 
The distribution of the test statistic for Hq and Hi, often referred to as the test 
statistic probability density function (tPDF), are obtained by throwing Poisson numbers 
around Si + bi and bi as a replacement for di. The confidence level CLb and its uncertainty 



^See e.g. table 32.1 
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process 21 SS + 3j 21 SS + 4j 31 OS + 2j 31 OS + 3j 



Z + jets 


1.7 


1.0 


1.7 


1.7 


W^ + jets 


1.8 


1.1 


1.1 


1.1 


yy +jets 


0.075 


0.023 


0.49 


0.12 


W^W^jj 


0.099 


0.019 


0.0024 


0.0024 


W+W'W^ 


0.0035 


0.00044 


0.0012 


0.00044 


ii + jets 


2.1 


0.83 


0.52 


0.25 


ttW^j 


0.19 


0.075 


0.052 


0.016 


ttz^ 


0.036 


0.011 


0.085 


0.063 



total 5.90 3.02 3.86 3.16 

Table 3: The upper limit on the number of expected events for 200 pb^^ of data for each of the 
background processes at a confidence level of 95%. Systematic uncertainties on cross sections are 
not taken into account. 

is calculated as follows. In the presence of data, CLb is given by the integral of the tPDF 
of the background hypothesis from — oo to the measured value of 21ogX. For this study, 
we replace this value by the mean of the tPDF for the signal + background hypothesis 
to substitute collision data. The uncertainty on CLt, is then obtained by changing the 
integration limit to the mean plus/minus one standard deviation of the signal + background 
tPDF. To claim a 5o" excess over the background expectation, we have to be sensitive to 
CLb at the order of 10~ . For this reason, we generate 10^ pseudo-experiments for each of 
the tPDFs for Hq and Hi. 

In table 3 we list the expected number of events for all background processes for the 
four channels considered. The corresponding values for the 30 signal benchmark points, 
including the results for the confidence level CLb, are given in table 7 of appendix A. Even 
in the worst case scenario, we expect a signal evidence of at least 3o" for 23 benchmark 
points. For 10 points among these 23, the central CLb value corresponds to an excess over 
the SM expectation of at least ba. 

6.3 Discovery potential of tw^o benchmark points 

We now focus on the discovery potential of two promising benchmark points. One is BP 10, 
which has both xi and X2 below 500 GeV; the other is IBP 18. Both benchmark points 
exhibit a 4 of 50(4) and have large cross sections (10.78 pb and 5.52 pb respectively), 
yielding a relevant excess over the SM background. We use a simple cut-based analysis 
and outline some features of their specific phenomenology. 

Phenomenology of the two benchmark points. As we can see from table 1, the 
lightest new quark for BP 10 is the top-like ti with a mass of 316.6 GeV. The full mass 
hierarchy for the new quarks with a mass below 500 GeV reads 

mt^ (316.6) < m^, (365.2) < mt^ (374.4) < nn,^ (377.3) 
< mt^ (377.9) < ma;2 (395.3) < nit^ (473.3) , 
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where the masses are given in GeV. For this point, the mass difference between the t4 quark 
and the other quarks is such as to allow the t^ to decay into most of them. The full list of 
branching ratios for all the above listed quarks can be seen in table 4. 
For IBP 18, the mass hierarchy is 



m 



n (365.2) < rrit, (367.6) < nib, (373.9) < mj^ (403.1) 



Given that the maximal mass difference among these quarks is about 40 GeV, their de- 
cay modes are described by eqs. (3.1) and (3.2). In table 5 we list the branching ratios 
corresponding to these decay modes. 

BP 10: 



BR(ii ^ bW+y. 2.74 
BR(ti^iZ): 1.68 
BR(ii -^ th): 5.58 


10-1 
10-1 
10-1 


BR(t2 -^ bW+): 
BR(t2 -^tZ): 
BR(t2 -^th): 


5.47 
3.80 
6.20 


10-5 
10-1 
10-1 


BR(t3 -^ bW+) 
BR(t3 -^ tZ): 
BR(t3 -^ th): 


2.09 
8.95 
8.17 


10-2 
10-1 
10-2 


BR(t4 -^ 6VF+): 6.89 
BR{t4^biW+): 1.17 
BR(t4 -^ xiW-): 3.01 


10-2 
10-2 
10-2 


BR{t4^tZ): 
BR(t4 -^ tiZ): 
BR(t4 -^ t2Z): 


3.56 
7.03 
1.38 


10-1 
10-2 
10-3 


BR{ti^t^Z): 
BR(t4 -^ th): 
BR(t4^ti/i): 


5.98 
4.42 
1.02 


10-3 
10-1 

10-2 


BR{bi -^ tW): 9.96 


10-1 


BR(6i -^ bZ): 


3.78 


10-5 


BR(6i -^ bh): 


2.29 


10-5 


BR{xi ^tW+): 1.00 


BR(x2 -^ tW+) 


9.97 


10-1 





Table 4: The branching ratios for the seven quarks with a mass below 500 GeV for BP 10. Note 
that the branching ratios may not add up to 1 as possible three-body decays might contribute. 



IBP 18: 



BR(ti -^ bW+): 1.56 • 10-2 
BR(ti -^ tZ): 9.48 • IQ-i 
BR{ti-fth): 3.64-10-2 




BR(6i -^tW-): 1.00 
BR(6i -^ bZ): 4.06 • 10-5 
BR{bi^bh): 2.47-10-5 



BR(a;i -^ tW+): 1.00 



Table 5: The branching ratios for the four quarks with a mass below 500 GeV for IBP 18. 

Cut-based analysis. We outline a simple, cut-based analysis for the two benchmark 
points to illustrate a complementary way to investigate the discovery potential of the 
model. For this analysis we use the lepton and jet selections defined in section 5.3. Given 
the results from tables 3 and 7, we ask as preselection to have at least two same-sign 
(isolated) leptons (e, n) with px > 20 GeV and \r]\ < 2.4. 

Figure 6 shows the expected number of jets per event for BP 10, IBP 18 and the SM 
background after preselection for an integrated luminosity of 200 pb- . Based on these 
distributions we impose a cut on at least 2 jets, where the jets are requested to have 
Pt > 50 GeV. As a next step, we make use of the variable /ly, which is defined as the 
scalar sum of the transverse momentum of the selected jets and leptons per event. In 
figure 7 we show the overlaid distributions of hx for BP 10 (red), IBP 18 (blue) and the 
SM background (grey) scaled for 200 pb- of data. The distributions were obtained only 
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Figure 6: Number of jets per event for 
BP 10 (red), IBP 18 (blue) and the SM back- 
ground as from table 2 (gray). The plot is 
scaled to give the number of expected events 
for an integrated luminosity of 200 pb^ . 
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Figure 7: The scalar sum of the -pT of all 
selected leptons and jets per event after pre- 
selection. The scaling and the color code is 
the same as on the left. 



after imposing the preselection cut. Clearly, this variable can be used as a powerful cut to 
suppress the background contribution. For this reason, we require an hj' > 300 GeV for 
the events to pass this cut. From figure 8 we see that the signal distributions of the pr of 
the hardest jet peak at larger values than the corresponding SM background distribution. 
Consequently, we impose a cut at 90 GeV on this variable. Summarizing, we impose the 
following cuts: 

1. at least 2 jets with px > 50 GeV, 

2. /it > 300 GeV and 

3. pt of the leading jet > 90 GeV. 

In table 6 we list the efficiencies of the preselection and of the superposition of all 
cuts for the two signal samples and for each of the background processes. The efficiency 
of the cuts has been studied individually. Moreover, we list the expected number of events 
after having superimposed all cuts. We find that we can expect 62 and 41 events for an 
integrated luminosity of 200 pb^ for BP 10 and IBP 18, but only a total of 6.7 events 
arising from the SM backgrounds. To estimate how much integrated luminosity we need 
to obtain a 5a excess over the SM expectation, we again make use of the log likelihood 
ratios. In particular, we calculate the background confidence level 1 — CLb and require it 
to be smaller than the 5a probability of 2.87 • 10~^. For IBP 18 we find a 5a significance 
for an integrated luminosity of 46!t22 P^^ with an expected number of 9.52]*^^'55 signal 
and 1.54^'jnV^ background events. For BP 10 we find that a 5a excess is expected for an 



-0.74 



integrated luminosity of 241^2 pb , which corresponds to 7A2^^'j^ signal and O.SO^g 
background events. The uncertainties are obtained in the same way as in section 6.2. 



0.55 
40 
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sample 


preselection eff. 


total selection eff. 


expected # of events 


Z + jets 


7.22 


10-5 


1.54 


10-6 


„ .+0.74 
^•''±-0.37 


W + jets 


2.97 


10-5 


1.63 


10-7 


n 79+0.79 


FF + jets 


2.19 


10-2 


6.15 


10-4 


59+0-07 


w^w^n 


2.32 


10-2 


9.92 


10-3 


n 49+0-02 


W+W'W^ 


2.24 


10-2 


1.26 


10-3 


U.UiU_o.ooi 


ii + jets 


8.67 


10-^ 


1.89 


10-4 


o fj+0.2 
c5.D_Q2 


ttW^j 


2.44 


10-2 


1.11 


10-2 


^7+0-01 


ttz^ 


1.67 


10-2 


8.24 


10-3 


1 7+0-01 


BP 10 


3.79 


10-2 


2.87 


10-2 


61.8+l-J 


IBP 18 


4.97 


10-2 


3.75 


10-2 


41.41°:? 



Table 6: The efficiency of the preselection cut, the total cut efficiency and number of expected 
events for each background and the two signal samples for an integrated luminosity of 200 pb- . 
The stated uncertainty on the number of expected events corresponds to the 68.3% confidence 
interval of this number. The total background sums up to 6.7 events. 

Systematic errors are not taken into account. These results show that a discovery of this 
model may already be feasible at the LHC with only a few dozen inverse picobarns of 
understood collision data. 



7. Reconstructing the mass of a charge 
5/3 top-partner 

Among the top-partners, the charge 5/3 xi 
gives the largest contribution to the excess over 
the SM expectation in the SS di-lepton chan- 
nel. This is due to its low mass and the fact 
that it always decays to tW^, which leads to 



Xl 



tW^ 



bW+W-^ 



bl+l+viui (7.1) 



in the leptonic decay mode. For ti, which is 
the only new quark that could be lighter than 
Xl, only few of its decay modes (eq. (3.1)) pro- 
duce SS di-leptons in the final state. The accu- 
rate mass reconstruction of a charge 5/3 quark 
would be a big step towards the interpretation 



p hardest jet for | L=200 pb 
BP18 

I 1 BP10 

i^ry^.y^j SM background 




150 200 250 300 350 400 450 500 
p^(GeV) 



Figure 8: The px of the hardest jet after 
preselection for BP 10 (red), IBP 18 (blue) 
and the total SM background (grey). 

of the discovery. In the literature, different methods have been proposed for the recon- 
struction of its mass. These methods usually focus on pair production, so that they can 
exploit same-sign di-leptons from the decay of one of the charge 5/3 quarks to select and 
identify the event. The mass is reconstructed using the fully hadronic decay mode of the 
two W bosons coming from the other charge 5/3 quark [20, 23, 24]. In ref. [45] an al- 
ternative method is presented. The mass of a charge 5/3 top-partner is reconstructed in 
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SS di-lepton events via its transverse mass. This transverse mass is computed from the 
momenta of the two SS leptons, the missing transverse energy (from the two neutrinos) and 
the b jet belonging to the semileptonically (and not to the second, hadronically) decaying 
top quark. 

In the following, we outline a new method to reconstruct the mass of a charge 5/3 
quark xi. We exploit the same channel as [45], but we only rely on the two SS leptons 
and use the shape of their invariant mass distribution to reconstruct xi. This avoids b 
tagging inefficiencies and the problem of assigning the correct b jet to the corresponding xi 
decay. We also consider the situation in which an excess of about 50 SS di-lepton events (as 
expected for 200 pb~^ of collision data) is caused by the presence of multiple top-partners. 
In this case, we show how the method can be used to discriminate the signal against a 
hypothesized presence of xi only. 

7.1 Mass determination with 200 pb^^ of collision data 

The method. In the decay of a pair-produced xixi, the SS di-leptons come from the 
same decay leg and the positively (negatively) charged leptons can be assigned to the decay 
of xi (xi). The invariant mass distribution of the SS di-leptons contains information about 
the xi mass. In fact, the endpoint of this invariant mass distribution m^^^ is sufficient 
to determine irixi, since mf^^^ can be expressed in terms of the masses of the particles 
involved in the decay (7.1). The mass of xi is the only unknown parameter in this relation. 
An accurate measurement of this endpoint, however, is not possible with only 200 pb~ 
of collision data. We can use, instead, the shape of the invariant mass distribution to 
determine rux^. 

In ref. [46], an analytic expression for the shape of the invariant mass distribution 
Mic for the supersymmetric decay g ^ i ii, ii — )■ c x? is presented'^. As the kineinatic 
configuration of this decay is identical to eq. (7.1), we can use their results to model the 
shape of the invariant mass distribution of the SS di-leptons from leptonic xi (xi) decays. 
This shape function, however, does not take into account the possibility of a leptonically 
decaying tau-lepton originating from a W decay. Also, an inclusive electron and muon 
spectrum without any selection cuts was assumed. These two assumptions are not satisfied 
in our realistic analysis. A fit of the full invariant mass distribution does therefore not lead 
to an accurate estimation of mx^ ■ However, we find the shape of the tail of the distribution 
to be almost invariant under the effect of the selection cuts and the tau contribution^. A 
fit of the tail of the distribution is thus a powerful means to extract the mass of xi. 

In figure 9 we show the invariant mass distribution of the SS di-leptons from a pair- 
produced xi with a mass of 365 GeV. This is the xi mass for BP 10 and IBP 18. We 
apply the same selection as in section 6.3. Both the signal and the SM background (see 
table 6) are normalized to an integrated luminosity of 200 pb~ . With a leading order 
cross section of 1.64 pb for the signal, we estimate 15.3 it 0.2 SS di-lepton events due to 
xixi. When fitting the tail of the total distribution from the signal plus the SM with the 



^Spin effects were neglected in the calculation of the shape of the invariant mass distribution. 
*The systematic error introduced is < 3%. 
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shape function (starting from the peak of the distribution), we obtain a fitted mass mfit of 
370.0 lb 32.3 GeV. By rescahng the generated distribution with the signal cross section, we 
underestimate the statistical fluctuations in the number of events per bin. The statistical 
uncertainty of about 32 GeV on the fitted mass, however, correctly represents the precision 
expected with about 15 signal events. We conclude that fitting the tail of the invariant 
mass distribution of the signal plus the SM background leads to a fairly accurate estimate 
of the xi mass^. 

The above method assumes the total pro- 
duction cross section of the charge 5/3 top- 
partners to be dominated by pair-production. 
Neglecting the contribution of single quark pro- 
duction allows us to estimate the production 
cross section as a function of the quark mass. 
This neglected contribution affects neither the 
shape nor the endpoint, but changes the ab- 
solute normalization of the invariant mass dis- 
tribution of the SS di-leptons. The cross sec- 
tion for single quark production is typically 
small for relatively light top-partners, but in- 
fluenced by model-dependent electroweak cou- 
plings. For BP 10 and IBP 18 we find that 
the ratio of the leading order cross section for 
single xi production over xiXi pair produc- 
tion is about 5.8% and 2.3%, respectively. For 
these points, the errors introduced are smaller 
than the uncertainty of the next-to-leading or- 
der pair production cross section, which is ap- 
proximately 20% for top-partners with a mass 
of about 500 GeV [47]. 

Applying the method to two benchmark points. For the 4 of 50(4) and the XX 
signatures, the various top-partners in addition to xi contribute to the excess of SS di- 
lepton events and alter the invariant mass distribution. In the special case of BP 10 and 
IBP 18, there is a bottom-like 6i with a mass of about 10 GeV above the xi mass. Since 
it predominantly decays to W~t, it plays an important role for the additional production 
of SS di-leptons. In order to obtain SS (rather than OS) di-leptons from bibi decays, one 
lepton has to come from 6i and the other from bi. Therefore, the invariant mass distribution 
of the SS di-leptons from bibi decays does not show an endpoint, but rather a tail that 
extends far into the high invariant mass region. This is in contrast to the SS di-leptons 
from xixi decays. In case of BP 10, two charge 5/3 quarks below 500 GeV contribute to 
the excess of SS di- lepton events. Since X2 is more massive than xi, the invariant mass 
distribution due to its leptonic decay is broader and has a larger endpoint with respect 
to the xi contribution. The main effects of these additional top-partners (including the 
^When only fitting the signal without the SM background, we find mfn = 372.4 ± 30.3 GeV. 



M„ (QeV) 

Figure 9: Invariant mass of the SS di- 
leptons from leptonic decays of a xi with mass 
365 GeV. The signal is stacked on top of the 
SM. The peak in the SM distribution at an 
invariant mass of about 520 GeV is caused 
by a single Z — >• /+/~ event that passed the 
selection cuts. 
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charge 2/3 quarks) on the invariant mass distribution of the SS di-leptons are an increased 
number of signal events and a large tail that hides the endpoint due to the light xi. These 
effects can be used to determine whether or not the expected SS di-lepton invariant mass 
distributions for BP 10 and IBP 18 can be explained by the hypothesized presence of a 
charge 5/3 top-partner only. 

In figures 10 and 11 we show the invariant mass distribution of the SS di-leptons for 
BP 10 and IBP 18 respectively, as expected to be observed with 200 pb~^ of collision data. 
The SM background for the same integrated luminosity is added to the signal distribution. 
As explained above, a fit of the tail of the distribution leads to a fairly accurate estimate 
of the xi mass, if the observation is caused by only one charge 5/3 quark (plus the SM 
contribution). For BP 10, we obtain mfit = 395.5 it 24.6 GeV and for IBP 18, we find 
mfit = 388.6 lb 29.7 GeV. This shows that a fit of the total distribution, including the 
contributions from the various top-partners and the SM backgrounds, leads to a systematic 
overestimate of the mass, which nevertheless remains within about la of the true xi mass. 




M|| (GeV) 

Figure 10: The invariant mass distribution 
of the SS di-lcptons for BP 10 plus the SM 
background (red), overlaid with the expected 
distribution due to a pair-produced xi quark 
with a mass mfit = 395.5 GeV (blue). The 
(blue) dashed distributions correspond to the 
variation of the xi expectation due to the 
uncertainty on the fitted mass of 24.6 GeV. 




Figure 11: Same distributions as on the 
left, but for IBP 18 with mtn = 388.6 ± 
29.7 GeV. 



As a next step, we calculate the cross section and simulate the expected signal for a 
pair- produced xi with the fitted masses. This signal plus the SM expectation gives the 
expected invariant mass distribution of the SS di-leptons for a given mass hypothesis. For 
BP 10 and IBP 18 (figures 10 and 11), we see that neither of the two signal distributions 
can be explained assuming the presence of only one charge 5/3 quark. In particular, we 
expect 62.5 and 6.7 SS di-leptons from BP 10 and the SM background respectively. Fitting 
the tail of the SS di-lepton invariant mass distribution, however, leads to an estimate of 
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lO.Olg'g SS di-leptons due to a pair-produced charge 5/3 top-partner. The stated errors are 
due to the statistical uncertainty on the fitted mass only. For IBP 18, 42.4 SS di-leptons 
are expected from the top-partners with 200 pb~ . Assuming the presence of xi only, the 
tail of the distribution suggests an xi mass that can account for 11.2l3'4 SS di-leptons. 
For BP 10 and IBP 18 we are left with respectively 52 and 31 unexplained SS di-leptons. 
The uncertainty on these numbers is dominated by the Poisson uncertainty of the 62.5 and 
42.4 expected events. 

The possibility of the signal to be mainly caused by a very light ti can be excluded 
in the following way. The dominant channel for ti to produce SS di-lepton events includes 
the leptonic decay of a Z, ti — )■ tZ. In this case, three leptons are produced and two OS 
leptons come from the Z. A veto on a mass window around mz for OS di-leptons thus 
helps to suppress the ti contribution to the signal. For BP 10 and IBP 18, leptonic ti 
decays account for 5% and 19% of the signal. Cutting on a window of mz ± 10 GeV results 
in a loss of about 20% of the total signal, but reduces the ti contribution by about 70%. 
Alternatively, one could directly veto tri-lepton final states to curb the contribution from 

h. 

We conclude that for both benchmark points, 200 pb~ of collision data would be 
sufficient to obtain an evident discrepancy between the total invariant mass distribution 
and the distribution based on the hypothesized presence of only one charge 5/3 top-partner. 
Such an observation could be seen as evidence in favour of a model with multiple top- 
partners. If instead the signal distribution were consistent with the expected distribution 
from rrifit, much more than 200 pb^ of collision data would be needed for the distribution 
to reveal the presence of additional, heavier top-partners. 

Beyond the 200 pb^ scenario. When more integrated luminosity has been collected 
at the LHC, advanced techniques can be used to resolve more details about the masses of 
the top-partners. The identification of either a full 4 of S'0(4) or two charge 5/3 quarks 
would be evidence in favour of our model. The signal in the SS di-lepton channel can 
be produced by various top-partners and it may be difficult to disentangle the different 
contributions. Discriminating the SS di-lepton events due to xi from the contribution due 
to bibi would be an important step. In the SS di-lepton channel, the two leptons from 
xixi decays come from the same particle, whereas in bibi decays they come one from a 
quark each. In the OS di-lepton channel, the roles of xi and 6i are exchanged. The shapes 
of the SS and OS di-lepton invariant mass distributions may help to gain insight in the 
underlying physics. 

8. Conclusions 

We reviewed a composite Higgs model and highlighted some of its most important features. 
We used vector- like fermionic resonances to reconcile the model with EWPT. This is more 
easily achieved when two sets of composites are below the cutoff of the effective theory. 
We showed that in this case the collider phenomenology is very rich, and in particular 
we can obtain some distinctive signatures for our model. These are the cases when a full 
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4 of SO {4) or two charge 5/3 top-partners lie within the reach of the LHC. We scanned 
the parameter space of the model focussing on points that are consistent with EWPT 
observables and give these signatures. For these signatures we described the possible mass 
hierarchies and outlined the basic features of their phenomenology. We find that the tri- 
lepton and same-sign di-lepton final states are the most promising ones for a discovery of 
the model. 

We studied in detail the phenomenology of two benchmark points with a large produc- 
tion cross section. Both exhibit a 4 signature and one has two charge 5/3 quarks with a 
mass below 500 GeV. We presented a robust cut-based search strategy for an excess in final 
states with at least two same-sign leptons. After making a basic kinematic selection, only 
little background from the SM was found in this channel. We find that for both benchmark 
points a few tens of inverse picobarns of understood collision data would suffice to observe 
a 5a significance. 

Since the SM contamination in the same-sign di-lepton final state is small, this chan- 
nel is not only well suited for observing an excess over the SM expectation, but also for 
reconstructing the masses of the new particles. Among the top-partners, the light charge 
5/3 quark contributes the most to the excess of SS di-lepton events. We described a new 
method to reconstruct the mass of such a quark via its leptonic decay. This method only 
relies on the reconstruction of the two same-sign leptons and exploits the shape of their 
invariant mass distribution. For both distinctive signatures of the model, the light top- 
partners besides the charge 5/3 quark also contribute to the excess of same-sign di-lepton 
events. In this case, we showed how the mass reconstruction method could be used to judge 
if the excess of same-sign di-lepton events is compatible with the presence of a charge 5/3 
quark only, or if it hints at the existence of additional top-partners. For this, we used 
the fact that the cross section for pair production of top-partners can be predicted as a 
function of mass. Already with an integrated luminosity of 200 pb~ and a corresponding 
statistics of about 50 signal events, we found an evident disagreement between the single xi 
hypothesis and the expected observation. Such a disagreement can be seen as an indication 
for the presence of top-partners in addition to a charge 5/3 quark. 
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A. Discovery potential of the 30 benchmark points 

In table 7, we list the number of expected events for the 30 benchmark points in each 
of the four considered final states. We also give the corresponding background confidence 
level 1 — CLb arising from a combined search in the four channels. We use the log likelihood 
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ratios (as defined in section 6.2) as a test statistic. We indicate if the central 1 — CLb value 
corresponds to an excess of at least 3a or 5cr. 



BP 21 SS + 3j 21 SS + 4j 31 OS + 2j 31 OS + 3j 



1-CLb 



IBP 1 


7.22 


5.82 


4.45 


4.12 


1.3 


10^6 


+2.2 
-1.3 


10-i 
10-6 


> 3(7 


hBP 1 


3.76 


3.40 


2.35 


2.53 


2.2 


10-3 


+4.1 
-2.2 


10-2 
10-3 




IBP 2 


6.70 


5.59 


4.13 


3.42 


5.9 


10-6 


+6.8 
-5.9 


10-"* 

10-6 


>3a 


hBP 2 


4.64 


4.24 


2.68 


2.71 


4.8 


10-4 


+1.5 
-4.7 


10-2 

10-4 


>3a 


IBP 3 


12.70 


9.67 


8.49 


7.27 


* 1 


10-9 




> 5a 


hBP 3 


6.61 


5.87 


4.22 


4.18 


2.4 


10-6 


+3.4 
-2.4 


lO-'' 

10-6 


> 3(7 


IBP 4 


8.10 


7.01 


4.65 


4.38 


7.9 


10-8 


+3.2 
-7.9 


10-^ 

10-8 


> 5(7 


hBP 4 


2.94 


2.85 


1.70 


1.78 


1.1 


10-2 


+1.1 
-1.1 


10-1 

10-2 




BP 5 


6.97 


5.65 


3.93 


3.80 


3.7 


10-6 


+4.9 
-3.7 


10-"* 

10-6 


>3(7 


BP6 


2.91 


2.59 


1.73 


1.78 


1.4 


10-2 


+1.2 
-1.3 


10-1 

10-2 




BP 7 


3.06 


3.03 


1.73 


1.81 


9.1 


10-3 


+1.0 
-8.8 


10-1 
10-3 




BP8 


2.98 


2.81 


1.50 


1.52 


1.4 


10-2 


+1.3 
-1.3 


10-1 

10-2 




BP9 


7.19 


5.66 


3.81 


3.68 


3.8 


10-6 


+4.9 
-3.8 


10-"* 

10-6 


>3(7 


BP 10 


14.62 


10.87 


9.14 


7.59 


* 1 


10-9 




> 5(7 


IBP 11 


10.37 


8.63 


5.56 


5.40 


* 1 


10-9 


+2.7 


10-^ 


> 5(7 


hBP 11 


5.70 


5.08 


3.32 


3.63 


3.4 


10-5 


+2.4 
-3.4 


10-3 
10-5 


>3(7 


IBP 12 


7.87 


5.87 


4.20 


3.53 


1.5 


10-6 


+2.3 
-1.5 


10-4 

10-6 


>3(7 


hBP 12 


8.42 


7.30 


4.86 


4.72 


3.7 


10-8 


+1.3 
-3.7 


10-5 

10-8 


> 5(7 


BP 13 


12.98 


10.16 


8.74 


7.38 


* 1 


10-9 




> 5(7 


BP 14 


10.44 


8.83 


6.59 


6.74 


* 1 


10-9 


+2.2 


10-8 


> 5(7 


IBP 15 


7.99 


6.11 


4.12 


3.83 


8.4 


10-7 


+1.5 
-8.4 


10-4 

10-7 


> 3(7 


hBP 15 


6.89 


5.71 


4.24 


3.99 


2.5 


10-6 


+3.6 
-2.5 


10-4 
10-6 


> 3(7 


IBP 16 


8.57 


6.87 


5.74 


4.86 


2.3 


10-8 


+8.6 
-2.3 


10-6 
10-8 


> 5(7 


hBP 16 


2.32 


2.21 


1.42 


1.34 


3.2 


10-2 


+2.0 
-3.0 


10-1 

10-2 




IBP 17 


9.15 


6.72 


5.44 


4.83 


2.0 


10-8 


+8.2 
-2.0 


10-6 
10-8 


> 5(7 


hBP 17 


2.43 


2.17 


1.35 


1.40 


3.2 


10-2 


+2.0 
-3.0 


10-1 

10-2 




IBP 18 


10.03 


7.48 


6.53 


5.09 


* 1 


10-9 


+6.8 


10-7 


> 5(7 


hBP 18 


4.42 


3.77 


2.67 


2.44 


9.7 


10-4 


+2.5 
-9.6 


10-2 
10-4 


> 3(7 


BP 19 


7.31 


5.42 


4.09 


3.83 


3.2 


10-6 


+4.2 
-3.1 


10-4 
10-6 


> 3(7 


BP 20 


6.49 


5.06 


3.79 


3.75 


1.3 


10-5 


+1.1 
-1.3 


10-3 
10-5 


>3(7 



Table 7: Number of expected events in each of the four channels for 200 pb-^ of integrated 
luminosity. The background confidence level 1 — CLb with its uncertainty is also given. The 1 — CLb 
values marked with (*) correspond to benchmark points for which more than 10^ pseudocxperiments 
would be needed for the tail of the tPDF of the background hypothesis to leak out of the integrated 
region. 
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